AMENDMENT UNDER 37 C.F.R. § 1.116 
U.S. Application No. 10/729,915 

AMENDMENTS TO THE CLAIMS 

This listing of claims will replace all prior versions and listings of claims in the 

application: 

LISTING OF CLAIMS: 

1. (CMrrenrfy^men(ie(i) A method for organizing a plurality of data files using 

meta-data wherein at least one meta-data element is associated with each data file, the method 
comprising: 

extracting, for at least some selected data files, at least one meta-data element associated 
•with a respective selected data file; 

organizing the extracted meta-data elements in a d e sir e d order into a single ordered set 
wherein the set is ordered consecutivelv based on values for one or more of the extracted meta- 
data elements : 

calculating pair- wise differences between values of each of the extracted meta-data 
elements; 

inputting at least one value of a clustering sensitivity parameter, said clustering 
sensitivity paranaeter defining granularity of the clustering, and multiplying each pair-wise 
difference by the clustering sensitivity parameter to obtain a plurality of similarity values for 
determining clustering based on the similarity values and at a granularity defined by 1he 
clustering sensitivity parameter; and 

dividing ihe selected data files into groups based on the similarity values. 
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2. Canceled. 

3. {Currently Amended) The method of claim 1, wherein determining the similarity 



the one pair consists of the data file and data file; 

5^ {i, j) is the similarity value for the data file and the 7* data file; 

K is the clustering sensitivity parameter value; 

ti is the value of a selected extracted meta-data element associated with the 
datafile; and 

tj is the value of a selected extracted meta-data element associated with the 
data file. 

4. {Currently Amended) The method of claim 1, wherein determining the similarity 
value of [[the]] a one pair comprises determining the similarity value of the one pair as : 



value of [[the]] a one pair comprises evaluating the similarity vdue as: 




where: 




where: 
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the one pair consists of the data file and/ data file; 
Sk (h j) is the similarity value for the data file and thejr* data file;" 
jST is the clustering sensitivity parameter value; and 

V; and. vj are ac?tual vector values determined firom the i*^ and the j*^ data files. 

5 . (Previously Presented) The method of claim 1 , fiirther comprising determining, 
for each of the selected data files, at least one novelty value hased on at least one similarity value 
of the selected data file and a nearby data file. 

6. (Previously Presented) The method of claim 5, wherein determining at least one 
novelty value comprises determining at least one novelty value as: 

vM=T.S^is-hl,s + n)g(l,n) 



where: 

v^, (j) is the novelty value; and 

g is a Gaussian tapered (21+1) x (2n+l) checkerboard kernel. 
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7. (Previously Presented) The method of claim 5, further comprising determining at 

least one boundary location between the selected data files based on the at least one novelty 
values determined for each of the selected data files. 

8. (Previously Presented) The method of claim 7, further coinprising determining, 
for at least the one determined boundary locations, a confidence value for that boundary location. 

9. (Previously Presented) The method ofclaim 8, wherein determining a confidence 
value for a boundary location comprises determining the confidence value as: 

where: 

C(B^ ) is the confidence value for the Bk" boundary; 

(i, j) is the similarity value for the data file and they* data file; . 

& is the index value of detected boundary at a particular value for the input clustering 
sensitivity parameter Xlevel. 

10. (Previously Presented) The method of claim 8, further comprising determining, 
for at least the one determined boundary locations, at least one clustering sensitivity parameter 
value that maximizes the confidence value. 



AMENDMENT UNDER 37 C.F.R. § 1.116 
U.S. Application No. 10/729,915 ' 

11. (Currently Amended) A method for organizing a plurality of data files stored in a 
digital memory using meta-data, wherein at least one meta-data element is at least associated 
with a corresponding one of the plurality of data files, the method comprising: 

extracting from the memory meta-data elements of the plurality of data files; 

organizing the extracted meta-data elements into a single ordered set wherein the set is 
ordered consecutively based on values for one or more of the extracted meta-data elements 

calculating pair-wise difference between values of each of the extracted meta-data 
elements; 

obtaining a desired value of a clustering sensitivity parameter for analyzing the meta- 
data; 

multiplying each pair- wise difference by "the clustering sensitivity parameter to obtain a 
plurality of similarity values having granularity defined by the clustering sensitivity parameter^ 

detennining a structure within the meta-data elements by comparing, for at least a subset 
of the plurality of data files, the similarity values; and, 

storing the structure of the data files in a memory. 

12. (Previously Presented) The method of claim 1 1 , further comprising clustering the 
data files into groups using the determined structure of Hie meta-data elements. 

6' . 
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13. (Original) The method of claim 12, further comprising determining boundaries 
jfrom the determined clusters of data files, wherein the boundaries are located between the 
determined clusters of data files. 

14. (Previously Presented) The method of claim 13, further comprising: 
determining a similarity value by comparing at least some of the meta-data elements 

associated with data files in a first selected cluster of data files to at least some other meta-data 
elements associated with data files of in the first selected cluster of data files; and 

determining a dissimilarity value by comparing at least some of the meta-data elements 
associated with data files in a second selected cluster of data files to at least some of the meta- 
data elements associated with data files in a third selected cluster of data files, wherein the 
second selected cluster of data files and the third selected cluster of data files are different 
clusters of data files. 

15. (prf^maZ) The method of claim 14, further comprising: 

determining a value corresponding to a desired grouping of the clusters of data files based 
onthe differences of the simfiarity values and the dissimilarity- values. 

16. . (Currently Amended) A storage medium storing a set of program instructions 
executable on a data processing device and usable to organize a plurality of data files by using 
meta-data wherein at least one meta-data element is associated with each data file, the program 
comprising: 
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instructions for extracting, for at least some of the data files, at least one meta-data 

element associated with each of the some data files; 

instructions for organizing the extracted meta-data elements in a d e sired ord e r into a 
single ordered set wherein the set is ordered consecutively based on at least one of: 
chronological, alphabetical, numerical, or geographical ordering; 

instructions for calculating pair-wise difference between values of each of the extracted 
meta-data elements; 

instructions for inputting at least one clustering sensitivity parameter value, said 
clustering sensitivity parameter defining granularity of the clustering; 

instructions for multiplying each pair-wise difference by the clustering sensitivity 
parameter to obtain a plurality of similarity values having granularity defined by the clustering 
sensitivity parameter; and 

instructions for dividing the data files into groups based on the similarity values of the 
extracted meta-data elements. 

17. Canceled. 

18. (PrmoM5/>' Pre5entei^) The storage medium of claim 16, further comprising 
instructions for determiniiig, for each of the at least some data files, at least one novelty value for 
that data file based on at least one similarity value of that data file and a nearby data file. 
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1 9. (Previously Presented) The storage medium of claim 1 6, wherein instructions for 

determining the similarity value of the one pair comprises instructions for evaluating the 

similarity value as: 



I \t,-t 



K 



where: 

the one pair consists of the z**^ data file andj*- data file; 

(z, is the similarity value for the data file and the J*** data file; 
ins the clustering sensitivity parameter value; 

ti is the value of a selected extracted meta-data element associated with the z* 
data file; and 

is the value of a selected extracted meta-data element associated with thej* 

data file.. 

20. {Previously Presented) The storage medium of claim 1 6, wherein instructions for 
determining the similarity value of the one pair comprises determining the similarity value of the 
onepairas: 

'^A'0'>7) = exp| 



A'- KIKI 



where: 
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■flie one pair consists of the data file andy* data file; 

(i, j) is the similarity value for the data file and they*'' data file; 
K is the clustering sensitivity parameter value; and 

V/ and vy^aifc are actual vector values determined firom the i**" and the data files. 

21 . (Previously Presented) The storage medium of claim 1 8, further comprising 
instructions for determining at least one boundary location between the some_data files based on 
the at least one novelty values determined for the each of the at least some data files. 

22. (Previously Presented) The storage medium of claim 1 8, wherein instructions for 
determining at least one novelty value comprises instructions for determining the at least one 
novelty value as: 

v^(s)=^S^(s+l,s^n)g(l,n) 
1^ ■ 

where: 

Vii:(if).isthenovelty value; and . 

g is tiie Gaussian tapered (21+1) x (2n+l) checkerboard kernel. 

23. (Previously Presented) Thestora:g;emediumof claim 21, further comprising 
instructions for determining, for at least the one determined boundary locations, a confidence 
value for that boundary location. 
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24. (Original) The storage medium of claim 23 , wherein instructions for determining 

at least one confidence value comprises instructions for determining each of such confidence 

value as: 




where: 

C(5if) is. the confidence value for the boundary; 

Sk (hf) is the similarity value for the i* data file and the j* data file; 

b is Uie detected boundary at a level. 

25. (Previously Presented) The storage medium of claim 23, fijrther comprising 
instructions for determining, for at least the one determined boundary location, at least one 
clustering sensitivity parameter value that maximizes the confidence value. 

26. (Currently Amended) A data file organizing computerized system usable to 
organize a plurality of data files using meta data having at least one meta data element that is at 
least associated with a corresponding one of the data files, comprising: 

an input link for receiving the data files; 

a meta-data extracting circuit, Toutine, or application that extracts, for at least some of the 
data files, at least one meta-data element associated with that data file; 
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• a meta-data organizing circuit, routine or application that organizes the extracted meta- 
data elements in a desired order based on values for the extracted meta-data elements; 

a similarity value determining circuit, routine or application that! 

selects at least one clustering sensitivity parameter value, said clustering 
sensitivity parameter defining granularity of clustering; 

calculates a similarity value for at least two of the plurality of data files by 
performing a pair- wise comparison of the extracted meta-data elements and multiplying 
the pair-wise comparison by the clustering sensitivity parameter value; 

a novelty value determining circuit, routine or application that determines at least one 
novelty value for that data file based on the at least one similarity value for that data file and for 
a number of nearby data files; 

a data dividing determining circuit, routine or application that divides at. least some of the 
data files into groups based on the extracted meta-data elements and the input clustering 
sensitivity parameter value by determining at least one boundary location between ones of the 
plurality of data files based on the at least one novelty value determined for at least some of Hie 
data files; and 

a confidence value determining circuit, routine or application that detennines, for at least 
some of the determined boundary locations, a confidence value for that bouiidary location, 
wherein the data dividing circuit, routine, or application further determines, for at least some of 
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the determined boundary locations, the at least one clustering sensitivity parameter value that 
maximizes the confidence value; and, 

an output link for outputting the meta-data elements in a desired grouping order. 

27. (Previously Presented) The method of claim 1, wherein determining the 
similarity value of the one pair comprises a term depending on a scalar magnitude of a difference 
between ti and tj relative to K, where K is the clustering sensitivity parameter value, i references 
a first data file of the one pair, j references a second file of the one pair, ti is the value of a 
selected extracted meta-data element associated with the first data file, and tj is the value of a 
selected extracted meta-data element associated with the second data file. 

28. {Previously Presented) The method of claim27 wherein the depending comprises 
an exponentially decreasing fimction of the scalar magnitude of the difference between ti and tj 
relative to K. 

29. {Previously Presented) The method of claim 1, wherein determining the 
similarity value of the at least one pair of the selected data files comprises a term depending on 
an inner product of and vj relative to K, where K is the clustering sensitivity parameter value, Vt 
is an actual vector value determined from the i* data file, and v,- is an actual vector value 
detennined from the j* data file. 
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